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ABSTRACT 

We investigate the prospects of extending observations of high redshift QSOs from the 
current z ~ 7 to z > 8 by means of a very wide-area near-infrared slitless spectro- 
scopic survey, considering as an example the planned survey with the European Space 
Agency's Euclid telescope (scheduled for a 2019 launch). For any QSOs at z > 8.06 
the strong Lyman-a line will enter the wavelength range of the Euclid near-infrared 
spectrometer and imaging photometer (NISP). We perform a detailed simulation of 
Euclid-NISP slitless spectroscopy (with the parameters of the wide survey) in an ar- 
tificial held containing QSO spectra at all redshifts up to z = 12 and to a faint limit 
H = 22.5. QSO spectra are represented with a template based on an SDSS com- 
posite spectrum, with the added effects of absorption from neutral hydrogen in the 
intergalactic medium. 

The spectra extracted from the simulation are analysed with an automated red- 
shift hnder, and a detection rate estimated as a function of H magnitude and red- 
shift (defined as the proportion of spectra with both correct redshift measurements 
and classifications). We show that, as expected, spectroscopic identification of QSOs 
would reach deeper limits for the redshift ranges where either Ha (0.67 < z < 2.05) or 
Lyman-a (z > 8.06) is visible. Furthermore, if photometrically-selected z > 8 spectra 
can be re-examined and re-fitted to minimize the effects of spectral contamination, 
the QSO detection rate in the Lyman-a window will be increased by an estimated 
~ 60% and will then be better here than at any other redshift, with an effective limit 
H ~ 21.5. 

With an extrapolated rate of QSO evolution, we predict the Euclid wide (15000 
deg 2 ) spectroscopic survey will identify and measure spectroscopic redshifts for a total 
of 20-35 QSOs at z > 8.06 (reduced slightly to 19-33 if we apply a small correction 
for missed weak- lined QSOs). However, for a model with a faster rate of evolution, 
this prediction goes down to 4 or 5. In any event, the survey will give important 
constraints on the evolution of QSO at z > 8 and therefore the formation of the first 
super-massive black holes. The z > 8.06 detections would be very luminous objects 
(with Mb = —26 to -28) and many would also be detectable by the proposed Wide 
Field X-ray Telescope. 
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1 INTRODUCTION 

Luminous QSOs, powered by supermassive black holes 
(SMBH) of the order of ~ 10 9 Solar masses (Mq) have been 
spectroscopically identified at z ~ 6-7, when the age of the 
Universe was < 1 Gyr, but their even earlier evolution and 
(necessarily rapid) formation is as yet unobserved. 

The luminosity density of active galactic nuclei (AGN) 
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increases strongly with redshift from z — to z ~ 2, but 
beyond z ~ 3 the comoving number density is seen to de- 
cline exponentially, approximately as <E>* oc e~ z in both 
optical and X-ray surveys (Schmidt, Schneider and Gunn 
1995, Fontanot et al. 2007a, Brusa et al. 2009). Even so, in 
the optical-spectroscopic Sloan Digital Sky Survey (SDSS), 
some QSOs have been identified to z ~ 6 (Fan et al. 2003, 
2004, 2006) with relatively bright apparent magnitudes of 
zab ~ 20, which at these redshifts correspond to extremely 
high optical-UV luminosities of Mb = —27 to -28. At fainter 
magnitudes (zab — 21-22) more z ~ 6 QSOs have been 
identified in the SDSS deep strip (Jiang et al. 2008) and 
the Canada-France High Redshift Quasar Survey (CFHQZ; 
Willott et al. 2010), in most cases with strong Lyman-a 
emission lines in their spectra, bringing the total of con- 
firmed QSOs at 5.74 < z < 6.43 to at least 40. 

Willott et al. (2010b) found another QSO (CFHQS 
J021013-045620) at z = 6.44, setting a new distance record. 
This was again surpassed when Mortlock et al. (2011) re- 
ported the discovery in the UKIRT Infrared Deep Sky Sur- 
vey (UKIDSS) of a very luminous and massive (M1450 = 
-26.6, M BH ~2x 1O 9 M ) QSO (ULAS J1120+0641) at 
z = 7.08 (at an age of the Universe 0.77 Gyr). Its SMBH is 
accreting at approximately the Eddington-limit rate, which 
for an efficiency e = 0.1 implies an e-folding time for SMBH 
growth of 45 Myr (Willott et al. 2011b). Clearly, to have 
grown up from a seed black hole of stellar (< IOOM0) or 
star cluster mass (~ 1O 4 M0, possibly even more) at ~ 1 
dex/0.1 Gyr, it must have formed very early and accreted 
at a maximal rate. The existence of such a massive and lu- 
minous QSO at z ~ 7 is already a challenge for formation 
models; yet, especially if seed black holes can be very mas- 
sive, there could be similar QSOs awaiting discovery at even 
higher redshifts. 

Models for QSO number counts at z > 6 may dif- 
fer widely. For example, in the models of Fontanot et al. 
(2007b), if the observed exponential negative density evolu- 
tion (as at 3 < z < 6 can simply be extrapolated to 
z > 6, it is likely that in the near future some QSOs will be 
found at z > 8. However, if all QSOs at z > 6 are accret- 
ing at the maximum possible rate (Eddington luminosity 
and maximum duty cycle of 1), this implies a much faster 
rate of evolution, so that QSOs completely disappear from 
detectability at z ~ 7.5. 

More recently, Shankar et al. (2010) interpreted the 
strong clustering of z ~ 3-4 QSOs as evidence they reside 
in extremely massive halos and further that the duty cycles 
are high and increase to approach unity at z = 6. As such 
massive halos would become very rare at z > 6 and the 
duty cycle could not further increase, they predict that the 
negative evolution of the QSO density (^-) would steepen 
markedly (by a factor ~ 2) at z > 6. Again this greatly 
reduces the detectable numbers at even higher redshifts rel- 
ative to the extrapolated model. However, this is not a cer- 
tainty; the statistical errors on the clustering (ro) are large 
and the interpretation (in terms of halo mass, duty cycle 
etc.) is model dependent. 

Trakhtenbrot et al. (2011), by measuring Mgll line 
widths for z ~ 4.8 QSOs, find their SMBH masses to average 
almost 10 9 M©, with high (median 0.59) but widely spread 
Eddington ratios. All these SMBH must have at times ac- 
creted very rapidly to have grown to these masses within 



the < 1 Gyr available. Yet the moderate evolution of SMBH 
masses over 2.4 < z < 4.8 could only be fitted by assum- 
ing low duty cycles of 0.1-0.2. If some fraction of AGN had 
low duty cycles even at higher redshifts, their epoch of very 
rapid mass growth would be pushed back earlier. 

There are QSOs with the same high SMBH masses, but 
in smaller numbers, at 6 < 2 < 7. A scenario of very rapid 
(~ Eddington-limit) but intermittent (merger-driven?) ac- 
cretion, with diverse black hole growth histories, would again 
raise the possibility of finding some very luminous QSOs out 
at z > 8. However, as the e-folding time for growth is con- 
strained by the Eddington limit, the maximum redshift for 
detectable (very luminous) QSOs may set lower limits on 
the masses of the seed black holes (Willott 2011). Finding 
luminous QSOs at z > 8 or even 2 ~ 10 would have major 
implications for formation processes and favour models (e.g. 
Davies, Miller and Belovary 2011) with larger seed black 
holes (Mbh ~ 10 5 Mq), such as might form from the col- 
lapse of a nuclear star cluster. 

Another reason why spectroscopy of z > 6 QSOs is 
important is that these objects lie in the epoch of reion- 
ization, where (increasingly to higher redshifts) neutral hy- 
drogen (HI) in the intergalactic medium absorbs photons 
emitted by the QSO blueward of Lyman-a, especially at the 
redshift where they are shifted through this wavelength. Ev- 
ery QSO at these redshifts can be a useful probe of the HI 
content both near the source (e.g. Bolton et al. 2011) and 
along the line-of-sight. 

The UV and optical emission at A rest > 1216A is not 
affected by HI absorption, but at these redshifts it is shifted 
into the near-IR where ground-based spectroscopy is very 
compromised by the atmospheric emission, so there are great 
efficiency gains from space-based observation. Our aim in 
this paper is to investigate the prospects of identifying very 
high redshift QSOs in a space-based wide-area near-infrared 
slitless spectroscopic survey. We consider as an example the 
Euclid wide survey. 

The European Space Agency mission Euclid, now sched- 
uled for launch in 2019, will contain a Visible Instrument 
(VIS) and a near-infrared spectrometer and imaging pho- 
tometer (NISP) sensitive at 11000 < A < 20000A (Laureijs 
et al. 2011). The latter instrument will be used to perform 
a slitless spectroscopic survey of the greater part of the sky 
outside of the Galactic plane, at least 15000 deg 2 , over a 
period of several years. The principal target is emission-line 
star-forming galaxies at redshifts 0.67 < 2 < 2.07 where 
the Ha line will be detected. By measuring redshifts (to 
a(z) < 0.005) for tens of millions of these galaxies, the large 
scale structure will be mapped in detail over an enormous 
volume (see e.g. Wang et al. 2010). 

The survey is also expected to find > 10 4 type 2 AGN 
and ~ 10 6 type 1 AGN. The moderate spectral resolution 
of NISP, R ~ 250, should be enough to resolve the narrow 
Ha and [Nil] 6584 emission lines for a high proportion of 
the galaxies (their flux ratio is an important diagnostic of 
type 2 AGN and also metallicity sensitive) , and to recognize 
the broad emission lines characteristic of type 1 AGN. The 
broad lines are 'permitted' transitions including Ha, Mgll 
and Lyman-a (see Fig. 1). Hao et al. (2005) found the Ha 
FWHM of low-redshift AGN to be bimodally distributed, 
< 1200 km s" 1 for type 2 AGN and 1400-8000 km s" 1 for 
type Is. Other broad lines in type 1 AGN spectra are in the 
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same range (e.g. Sulentic et al. 2004; Shang et al. 2007), and 
in general greater than the FWHM of ~ 1200 km s _1 corre- 
sponding to the NISP instrumental spectral resolution. Eu- 
clid NISP could be especially effective for identifying QSOs 
at z > 8.06, where the strong Lyman-a line would enter its 
wavelength range. 

We perform detailed simulations incorporating the most 
recent instrument parameters, a model of the QSO luminos- 
ity function (LF) and its evolution, and a spectral template 
based on observed high redshift QSOs, and follow with anal- 
ysis of the simulated spectra. Section 2 describes the mod- 
elling of the QSO spectra and evolution, and Section 3 the 
set-up of the simulation. Section 4 describes the analysis of 
the spectra to obtain redshift and classifications and Section 
5 presents our results, in terms of a QSO detection rate as a 
function of redshift and magnitude. In Section 6 we predict 
the numbers and some of the properties of the high-redshift 
QSOs which might be found in the NISP wide survey and 
Section 7 is summary and discussion. 

Magnitudes are given in the AB system where itiab = 
—48.60 — 2.5 log F v (in ergs cm _2 s _1 Hz _1 ). We assume 
throughout a spatially flat cosmology with Ho = 70 km 
s _1 Mpc _1 , Qm = 0.27 and Q.a = 0.73, giving the age of the 
Universe as 13.88 Gyr. 



2 MODEL REPRESENTING THE QSOS 

2.1 Template Spectrum - Broad Lines 

To represent the intrinsic spectrum of all QSOs in the sim- 
ulation we use the composite spectrum of Vanden Berk et 
al. (2001), which was created from over 2200 QSO spec- 
tra from the SDSS. The input spectra cover the redshift 
range 0.044 < z < 4.789 and the full luminosity range of the 
QSOs meeting the selection criteria. The composite covers 
800-8555 A with a signal/noise peaking at > 300 but much 
lower nearer either wavelength extreme. 

Table 1 lists the 7 most prominent broad (FWHM ~ 
2000-6000 km s _1 ) lines in this spectrum, critical for iden- 
tifying the spectrum as a QSO and determining the red- 
shift. There are also narrow emission lines which have lower 
fluxes than these, the strongest being [OIII]5007 (A„ ac = 
5008. 24A) with relative flux 2.49; there are many fainter 
lines, see Table 2 of Vanden Berk et al. (2001) for a full 
list. Fig. 1 shows the spectrum in the Fa form; the best-fit 
spectral index is a„ = —0.46 (ax = —1.54) over most of the 
range, becoming redder at A > 6000A due to host galaxy 
contributions. 

At wavelengths shortward of Lyman-a at 1216A, and 
especially of Lyman-/3 at 1030A, the Vanden Berk composite 
shows a drop in flux due to absorption from neutral hydrogen 
(HI) in the intergalactic medium (IGM). Fan et al. (2004) 
and Jiang et al. (2007) compare spectra of high luminosity 
(Mb — —27) QSOs at z ~ 6 directly with this template and 
find good agreement, except the observed spectra fall off 
even more steeply in the blue wing of Lyman-a (making the 
line slightly narrower), because IGM absorption is stronger 
at z ~ 6 than at z < 4. The spectral break at A res t ~ 
1200A becomes almost 'total' e.g. Willott et al. (2010b) give 
a colour i-z> 3.07 for a QSO at z = 6.44. 

Hence for our simulation it is preferable and more re- 
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Figure 1. Composite spectrum for type 1 QSOs in the SDSS 
(from Vanden Berk et al. 2001: replaced at A < 1187A by the 
power-law from Tclfcr et al. 2002), plotted on a linear scale, as 
flux per Angstrom (Fa in arbitrary units). Prominent broad lines 
are labelled, and also the strongest narrow line [OIII]5007. 



Line Wavelength (A„ ac ) A Relative flux Width ax A 



Lyman-a 1215.67 100.0 19.46 

Si IV 1396.76 8.9 12.50 

C IV 1549.06 25.3 14.33 

C III] 1908.73 15.9 23.58 

Mg II 2798.75 32.3 34.95 

H/3 4862.68 8.6 40.44 

Ha 6564.61 30.8 47.39 



Table 1. Principal broad lines in the SDSS QSO template, with 
wavelengths in the vacuum 

alistic to begin with an unabsorbed QSO spectrum blue- 
ward of Lyman-a, and then to subject this to absorption, 
modelled as a function of redshift. At A rest < 1187A we re- 
place the Vanden Berk template by a power-law f v oc i/~ 1,76 
(fx oc A~ 0,24 ), as fitted by Telfer at al. (2002) to a mixed 
sample of low-z QSOs (selected to be unaffected by absorp- 
tion) at 500 < A < 1200. Additionally, the Lyman-/3 line 
(by far the strongest line in this range) is extracted from 
the Vanden Berk composite and re-inserted on top of the 
power-law. The resulting template, shown in Fig. 1, agrees 
well with the QSO composite in Fig. 4 of Telfer et al. (2002). 
The template is extended redward (to the K band) with the 
low-resolution mean QSO spectrum (the all-QSO average) 
from Richards et al. (2006), converted to fx and spliced to 
the Vanden Berk spectrum at A > 6900A. 

For this simulation, we have assumed this averaged tem- 
plate (in which the Lyman-a rest-frame equivalent width 
EW ~ 75A) can approximately represent high-luminosity 
QSOs, even at z > 8, provided we apply the appropriate 
IGM absorptions. This is an approximation as real QSO 
spectra are diverse and there are known examples of high- z 
QSOs with very weak or absent Lyman-a lines (e.g. Fan et 
al. 1999, 2006). Also, some z ~ 6 QSOs have Lyman-a lines 
which are strong but much less broad, including the lumi- 
nous J000552. 34-000655.8 (Jiang et al. 2008; originally Fan 
et al. 2004) as well as less luminous (Mb ^ —25.5) objects 
(Jiang et al. 2009). On the other hand, the lower- luminosity 
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QSO studied by Kurk et al. (2009) at z = 6.08 has a strong 
and broad Lyman-a (4350 km s _1 ; EW = 87A) even if its 
other lines are relatively narrow, and the highest redshift 
known QSO (Mortlock et al. 2011) has strong Lyman-a. 

Diamond-Stanic et al. (2009) investigate the Lyman-a 
emission lines of high-redshift SDSS QSOs and find a mean 
EW of 67.6A with a la range 40-102A. This 1-magnitude- 
wide range will introduce a corresponding scatter in the flux 
limits to which real QSOs can be identified but (as this is 
moderate and the distribution is centred at essentially the 
same EW as the adopted template) the total numbers of 
detectable QSOs should remain very close to our simula- 
tion prediction. But they also identified a population (20 
objects forming 6% of the QSOs at z > 4.2) with extremely 
weak or absent Lyman-a emission (EW < 15 A). These form 
an excess/tail above the EW distribution fitted to stronger- 
lined QSOs, impying that a few percent down-correction for 
a missed lineless fraction may still be needed for the pre- 
dicted QSO counts (Section 6.1). 

2.2 Absorption from Intervening Neutral 
Hydrogen 

Emission from A < 1216A in the QSO rest-frame will be 
redshifted through Lyman-a along the line-of-sight to the 
observer and will be subject to absorption if neutral hydro- 
gen is present. Shorter wavelengths will also be subject to 
absorption by Lyman-/? etc. The density of HI strongly in- 
creases with redshift. In our simulation, the intrinsic QSO 
template is subjected to absorption of optical depth r(A, z), 
following the model of Madau (1995). 

Photons blueward of Lyman-a, emitted from the QSO 
at redshift z q will be observed with wavelength X t s where 
A 6s < A a (l + z qso ) and \ a = 1216A. They suffer Lyman- 
a absorption along the line of sight at a redshift z a b s = 
Aobs/Aa — 1 (always z abB < z qso ), with an optical depth 
(Madau 1995) of approximately r a = 0.0036(1 + z aba ) 3A6 . 

There are further additive terms in r (of decreasing 
size) for wavelengths emitted shortward of Lyman-/3 (A^ = 
1026A), Lyman-7 and further lines in the series. Our model 
includes the Lyman-a, j3, 7 and 8 absorption terms as given 
by Madau (1995), ignoring the higher lines, but including 
the absorption of A em < 912A (beyond the Lyman limit) 
photons with the optical depth given by the more complex 
analytic expression in the footnote on page 21 of Madau 
(1995). Note the simulation is insensitive to the details of 
the model used for HI absorption, as its effects will become 
visible to NISP only at z > 8.06, where the model gives a 
very strong break with almost zero flux at A rest < 1200A; 
something already seen for QSOs at z > 6 (e.g. Willott et 
al. 2011b). 

Here the Madau (1995) absorptions are only applied 
in full up a distance A(z) = z qso — z a bs = 0.125 away 
from the QSO. This is to represent the ionized HII 'bubble' 
(Stromgren sphere) of several Mpc radius that a QSO forms 
around itself by its own ionizing-photon emission. The effect 
of this is to allow the light in the Lyman-a line itself, or 
most of it, to reach the observer, even if there is very heavy 
absorption of wavelengths immediately blueward of the lines 
(e.g. Becker et al. 2001). Carilli et al. (2010) study the ion- 
ization zones for a sample of 5.7 < 2 < 6.4 QSOs, plotting 
transmission vs. radius profiles (which appear approximately 



Gaussian), and define a zone radius as that where photon 
transmission falls to 10%. 

In our model, at A(z) < 0.125 from the QSO, the 
Madau model optical depth from each line is reduced to 
t = [{z qso -z aba )/0.125} 2 , which gives a Gaussian-profile de- 
cline in transmission versus radius (or equivalently, in wave- 
length shortward from the emission line). Transmission is 
100% at A resi = 1216A but, for a QSO at 2 = 6, falls to 
10% at a proper radius 7.3 Mpc (in agreement with Car- 
illi et al. 2010). At this radius, emission from the QSO at 
Arest = 1196A will be redshifted through Lyman-a, so this is 
approximately the rest-frame wavelength the observed spec- 
trum would show a cut-off (break). Fig. 2a shows the mod- 
elled transmission as a function of rest-frame wavelength for 
QSOs at different redshifts. 

Carilli et al. (2010) and Bolton et al. (2011) find the 
ionized zone radii tend to decrease with increasing redshift, 
a sign that the (late end of the) IGM reionization epoch has 
already been reached at 2 ~ 6. Our model's ionized zone 
radius decreases with redshift as, firstly, the proper radius 
corresponding to A(z) — 0.125 decreases with redshift, and 
in addition, the optical depth at A(z) = 0.125 increases 
rapidly, as (I + 2) 3 ' 46 , causing the radius where transmission 
is 10% to decrease more steeply (Fig. 2b). The ionized zone 
radius for the 2 = 7.085 QSO of Mortlock et al. (2011) is only 
1.9 Mpc (so if typical might suggest even faster evolution). 
In our model this radius is still 1-3 Mpc in the 8 < 2 < 10 
range of most interest for this simulation, consistent with 
evidence that > 1 Mpc scale ionized zones can already exist 
for galaxies at 2 ~ 8.5 (Lehnert et al. 2010). 

Fig. 3 shows the QSO template spectrum in the Lyman- 
/3 to Lyman-a range, at a range of redshifts, after applying 
the modelled IGM absorption. There will be almost zero 
flux observed at wavelengths shortward of Lyman-a for all 
2 > 7 sources. We expect the absorption to 'bite' increas- 
ingly into the blue wing of the line as redshift increases, 
making it narrower and more asymmetric in profile, but as 
the absorption (in this model) only ever affects the blue side, 
Lyman-a should still remain strong and detectable. 

2.3 Luminosity Function and Evolution 

The LF of AGN is usually represented by a double power-law 
form (Boyle et al. 2000), expressed in absolute magnitudes 
as: 



\ ' ' IQO. 4a(M-M') ^_ ^Q0.4/3(M-M*) 

The bright- and faint-end slopes are respectively 
parametrized as a and /3, with a being steeper. The AGN 
LF undergoes a strong evolution with redshift, which has of- 
ten been represented by a pure luminosity evolution model 
(PLE), e.g. the form M g = -22.18 - 1.44z + 0.315z 2 from 
Croom et al. (2009), in which the L* luminosity rises to a 
peak at z ~ 2.3 and decreases again beyond this. However, 
the PLE model is not the best-fitting model, as there is ev- 
idence that the evolution is luminosity dependent, with the 
flux from lower luminosity AGN peaking at lower redshifts. 
Two ways of representing this are (i) a model in which both 
L* and the bright and/or faint-end slopes of the LF evolve 
with redshift (e.g. Croom et al. 2009), and (ii) luminosity- 
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Figure 2. (a) Model (from Madau 1995) for IGM line-of-sight 
transmission as a function of wavelength in the QSO rest-frame, 
for different QSO redshifts, showing increasing absorption short- 
ward of Lyman-a; (b) The proper radius from the QSO corre- 
sponding to A(z) = 0.125 and where (in our model) the trans- 
mission of light being redshifted through the Lyman-a wavelength 
falls to 0.1 (10%), as a function of QSO redshift. 
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Figure 3. The QSO template after IGM absorption, plotted in 
rest-frame A and showing the Lyman- 0—ct range only, illustrating 
the marked effect of increasing absorption on the Lyman-a line 
profile and spectral break. 
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Figure 4. Our model luminosity function (B07 LDDE plus k ev = 
—0.47 density decrease at z > 2.7) in the B-band for Type 1 
QSOs at a series of redshifts, showing a rise from z = to z = 
2-3 followed by a steady decine to z = 12. Also shown is the 
Willott et al. (2010a) best-fit LF for QSOs at z ~ 6 (assuming 
B = 7711450 — 0.68). 



dependent density evolution (LDDE). In this paper we use 
the LDDE model as given by Bongiorno et al. (2007), which 
was fitted to type 1 AGN from the VIMOS VLT Deep Sur- 
vey (VVDS) and SDSS over the range 1 < z < 4. 

However, at higher redshifts (z ~ 4) both optical and 
X-ray data indicate more rapid negative $* evolution than 
given by this model. Schmidt et al. (1995) and Brusa et al. 
(2009) found observations of QSOs at z = 3.0-4.5 to be 
consistent with an exponential decrease in comoving num- 
ber density, $ oc e - 10 ( z - 2 7 ) or io- - 43 ( z - 2 ' 7 ) . Civano et 
al. (2011) found this same form to fit Chandra-COSMOS 
counts to even higher redshifts of z ~ 5.3. The LF analysis 
of Willott et al. (2010a) assumed slightly faster evolution 
of I0- 47z at z ~ 6 and Fontanot et al. (2007a) fitted the 
QSO counts at 3.5 < z < 5.2 with evolution e - 1 - 2 6(±o is) z = 
■^Q-o.55(±o.08)z . djfr erences between these estimates are 
probably only at the level of the statistical uncertainties. 

We therefore use the Bongiorno et al. (2007, here- 
after B07) LDDE model for < z < 2.7 only, and at all 
z > 2.7 assume density evolution with a rate $(Ms,z) = 
$(A'/ s ,2.7)10- 047(z - 2 7) , calling this model k ev = -0.47. 
Fig. 4 shows the resulting B-band LF at redshifts z = to 
2 = 12. 

For comparison, also plotted is the LF fitted by Willott 
et al. (2010a) to a sample of z ~ 6 QSOs, which is $* = 
1.14 x 10" 8 , a = -1.81, /3 = -0.5 and M B = -25.81 (as- 
suming B — mi45o — 0.68 as calculated from the template 
spectrum). At z = 6 the Willott et al. (2010a) LF is a little 
below our model, by 40% at Mb = —28, and it is also ~ 40% 
lower in normalization than the Jiang et al. (2009) LF, which 
was fitted to (high luminosity) z ~ 6 QSOs in the SDSS. The 
sample of Willott et al. (2010a) includes these same QSOs 
(so is not entirely an independent estimate) but it is more 
weighted to lower luminosity {Mb — —25.4) QSOs in the 
CFHQS. At this time more z > 6 QSOs are still needed to 
better constrain their LF at these redshifts; in Section 6.1 
we will consider both normalizations. 

Fig. 5 Shows the Corrections J observed — Brest, 

Hobserved — B r est and K b S erved — B r est calculated for our 
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Figure 5. Blue-band k-correction from z = to z = 12, 
and the corrections J observed — B rest , H obaerved — B rest and 
^observed ~ B res t, computed for the QSO composite spectrum 
(including IGM absorption). 
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Figure 7. Predicted counts of high-2 QSOs in A(z) = 0.5 inter- 
vals, shown on a log scale for a limit H < 23.0. Our model is com- 
pared with the non-evolving, exponential evolution (k ev = —0.54) 
and Eddington accretion models of F07b (for the 13b LF). 
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Figure 6. Our model redshift distribution N(z) for QSOs at 
limits H = 19.5 to 22.5. Also plotted (multiplied by 0.1 to fit 
on the same graph) is the H < 22.5 N(z) with a scaling up by 
10°' 58z , to greatly increase the proportion of high redshift AGN; 
this is the N(z) used for the simulation input catalog (Section 3). 



after F07b) for a deeper H < 23 limit. Fig. 7 shows our 
model QSO counts in A(z) = 0.5 bins compared with the 
F07b non-evolving, k ev = —0.54 exponential evolution (from 
Fontanot et al. 2007a) and Eddington-accretion models, all 
computed with the '13b' LF. They are very similar to the 
F07b exponential model. The non-evolving model gives a 
very much higher count (and is already excluded by data, 
so is shown for comparison only), while the Eddington ac- 
cretion model predicts a very steep decline with QSOs only 
visible to z ~ 7.5, even at this faint limit. This is an ex- 
treme Eddington accretion model in which the QSOs have 
constantly accreted at the Eddington limit since formation 
up to 2 = 6. i.e. with a maximal duty cycle of 1. 

Similarly in the 'reference model' of Shankar et al. 
(2010) the AGN duty cycle is very close to unity at z > 6 
and their numbers (at a given luminosity) decline almost 
twice as steeply as in our model. With a lower duty cycle 
the growth of QSOs is more protracted so that more are de- 
tectable at z > 8 and some might be visible at z > 10 (e.g. 
Fig. 10 of Rhook and Haehnelt 2008). 



model's QSO template spectrum (including the redshift- 
dependent absorption shortward of Lyman-alpha). Also the 
B-band k-correction, approximately —1.35 log(l + z) to z ~ 
4, beyond which absorption causes the source to drop out of 
B-band observations, while remaining visible in the near-IR 
bands to z > 10. For the simulation we consider primar- 
ily the apparent magnitudes in the //-band, 1.40— 1.80/Ltm, 
the passband most similar to the NISP spectral range. The 
colours between the NIR passbands remain small, at least to 
z ~ 10; e.g at z = 8.2, H = 21.0 corresponds to K = 20.90 
and J = 21.28. 

Fig. 6 shows the model's redshift distribution, N(z), for 
type 1 QSOs at a series of H magnitude limits. At all these 
limits the N(z) peaks at z ~ 1 and very high redshift QSOs 
remain very rare, e.g. to H = 22 the fraction at z > 8.06 
is only 0.0016% (although for a 15000 deg 2 area this still 
amounts to some 86 objects). 

We also compare with predictions for 5 < z < 12 QSOs 
tabulated by Fontanot, Somerville and Jester (2007b, here- 



3 THE SIMULATION 
3.1 Set-up 

We generate simulations of NISP slitless spectroscopy im- 
ages using AXESIM (Rosati et al. 2009; Franzetti et al., in 
preparation). The necessary inputs for the simulation in- 
clude the QSO template spectra and a catalog of QSO mag- 
nitudes and redshifts. We use a set of 61 templates to repre- 
sent the QSOs; they are identical at A > 1216A but short- 
ward of Lyman-a include differing levels of IGM absorption 
(modelled as described above) corresponding to A(z) = 0.2 
intervals from z — to z = 12. 

As we aim to estimate detection efficiency over all red- 
shifts, the N(z) used to set up the simulation catalog is 
modified from the (B07) model by a scaling factor 10 ' 58z , 
giving a greatly increased proportion of high redshift objects 
- almost a flat N(z) (Fig. 7) - while retaining the 'correct' 
apparent magnitudes (as given by the model) for each QSO. 
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As type 1 QSOs in the wavelength range of interest are typi- 
cally 85-90% dominated by the central AGN (e.g. Hutchings 
et al. 2002, Goto et al. 2009), all the QSOs are represented 
as point sources (their size on the simulation image is just 
the instrumental point-spread function). 

AXESIM generates a simulation image containing the 
spectra of the input catalog objects, exactly as would be 
seen by NISP using the most recent instrumental parame- 
ters (Laureijs et al. 2011). We simulate a single square area 
0.5 x 0.5 deg 2 , and on this are placed a total 2187 QSO 
spectra, with redshifts up to z — 12 and to the faint limit 
H — 22.5 (preliminary tests found no QSOs fainter than this 
would be detected). 

For the simulation to be realistic it must at least at- 
tempt to replicate the effect of spectral contamination by 
neighbouring spectra, a serious problem for deep slitless 
spectroscopy, which will reduce the numbers of spectra cor- 
rectly identified (i.e. survey efficiency). Contamination is in- 
troduced in part by the artificially very high surface density 
of high-redshift QSO spectra (2187; without the l0°- 58z term 
there would only be 120) and partly by adding spectra of 
non-AGN faint galaxies. The galaxy spectra are copied di- 
rectly from a 0.5 x 0.5 deg 2 area of our COSMOS-IRAC 
(Spitzer Infrared Array Camera) based simulation of galax- 
ies in NISP (Franzetti et al., in preparation), but because 
there is already a greatly enhanced surface density of AGN 
spectra, only every other galaxy spectrum from this model 
is added. This still amounts to 13023 faint galaxies on the 
frame, although only 5814 of these are brighter than the sim- 
ulation limit (K ~ 22.4) and only 501 have Ha emission lines 
above the nominal survey detection limits (0.67 < z < 2.05 
and F(Ha) > 3 x 10~ 16 ergs cm _2 s _1 ). This representa- 
tion of spectral contamination is inevitably approximate, as 
for one thing the emission-line properties of the simulated 
galaxies are significantly model-dependent, and because the 
simulated broad-linc QSOs and the galaxies differ greatly 
in spectra and redshift ranges and so cannot produce quite 
equivalent effects. But it should be good enough for a reason- 
able estimate contamination's adverse effects on QSO red- 
shift measurement and for of us to test a method of reducing 
this (Section 4.3). 

The total exposure time for the simulation image is 1600 
sec in total, which, following the currently planned wide sur- 
vey configuration is split into 4 exposures - the 4 combina- 
tions of two (blue-half and red-half) filters and two slit roll 
angles (0° and 90°). 

3.2 Extracting the Spectra 

After generating the simulation image, AXESIM then ex- 
tracts the spectra for each target object, at both roll an- 
gles, exactly as would be done for real NISP data (Kiimmel 
et al. 2009). The spectra each have 919 pixels covering 
11000 < A < 20000A with pixelsize 9.8A. We will here be 
considering only the QSO spectra. Because of the limited 
field size, many spectra are partially off the field edges. In 
total, because of geometric considerations, only 1468 of the 
QSO spectra could be extracted successfully (including 37 
for which we obtained only one of the two roll angles). How- 
ever, losses of this type will be far less significant in the real 
wide survey, in which Euclid will scan strips or patches with 
the fields-of-view mosaicked to cover much larger contigu- 



ous areas without gaps (Laureijs et al. 2011). Hence for the 
purposes of this paper we disregard the spectra that could 
not be extracted (because they are partially outside the field 
of view), and successful detection rates will be calculated as 
fractions of the numbers of extracted spectra. 

For each observed object ID spectra from the zero and 
90° roll angles were combined making use of a decontami- 
nation tool which is part of RESS (Redshift Extraction from 
Slitless Spectroscopy) an automatic software running within 
IRAF environment, which has been devised (Rossetti E. 
2011, in preparation) to reduce and analyze contaminated 
slitless spectra. RESS 2-measure is currently implemented 
only for low 2 galaxies (0.7 < 2 < 2.0) and its extension for 
high z objects is foreseen. The RESS decontamination tool is 
optimized to detect and exclude from the final combination 
the contaminated portions in each spectrum, when present. 
However, this is only sometimes effective where, as in this 
case, there are only 2 roll angles, and will be much more 
efficient for the deep fields where a greater number of roll 
angles will be utilized. 

Fig. 8 shows 6 examples of spectra extracted from the 
simulation (roll angles summed); these are brighter (H < 19) 
examples without significant contamination, in order to 
most clearly illustrate the spectral features visible at dif- 
ferent redshifts. 

4 ANALYSIS OF THE SIMULATED SPECTRA 

4.1 Method: Automated Redshift Finder plus 
Line Width Measurement 

For a survey of this size the spectra must (first) be analyzed 
with a fast and efficient automated redshift finder, and for 
our simulation we use the program EZ (Garilli et al. 2010), 
which is supplied with a large set of template spectra, includ- 
ing the SDSS-composite QSO spectrum and a wide variety 
of emission-line and absorption-line galaxies. EZ classifies 
spectra and fits redshifts by a procedure which combines (i) 
correlating spectra with templates (ii) refining the correla- 
tion with a x 2 fit (which can often choose the best amongst 
several redshifts where the correlation peaks) and (iii) emis- 
sion line finding and matching, only in cases where there are 
emission lines detected above a chosen threshold (by default 
5cr, which is retained here). 

For our analysis we add five more templates, the Vanden 
Berk et al. (2001) composite modified shortward of Lyman-a 
(as described in Section 2.1 and plotted on Fig. 1), and the 
same but with the IGM absorption (as modelled in Section 
2.2) appropriate to 2 = 2, 4, 6 and 8, giving an increasingly 
sharp and deep spectra break and asymmetric Lyman-a line 
(Fig. 4). Adding these templates is important, they improve 
the performance and accuracy of redshift measurement at 
redshifts where the Lyman-a is visible. 

EZ was run in batch (automated) mode on the full set 
of 1468 QSO spectra extracted from the AXESIM image, 
outputting for each a best-fit redshift and template (i.e. a 
spectral classification). To evaluate the results we count the 
numbers of spectra for which the input and best-fit redshifts 
agree within a chosen tolerance and, in addition, the spec- 
trum is correctly classified (i.e. the best-fit template is any 
QSO template, rather than e.g. a starburst galaxy). 
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Figure 8. Simulated spectra plotted as flux F\, units ergs 
cm _2 s _1 A _1 , against A in Angstroms, as extracted from the sim- 
ulation image (afer combination of the two roll angle spectra). 
These are six of the brighter (H < 19) examples at different red- 
shifts, with the strongest visible emission features labelled. The 
last two are at z > 8.06 where Lyman-a enters the NISP range. 



However, there is a known 'problem' with the current 
EZ configuration, that if multiple > 5a lines or any 'strong' 
(> lOo") lines are found, the spectrum is only correlated with 
emission-line galaxy templates. While this feature might be 
removed, this might lead to undesirable misidentification of 
some real emission-line galaxies. A better solution is to fol- 
low redshift-measurement by a further analysis to fit the 
width (FWHM) of the strongest emission line. Any spec- 



trum found to have a bright line of width FWHM > 2000 
km s _1 can be confidently reclassified as a type 1 AGN. It 
will also be useful to investigate here the limits to which 
line FWHM can accurately be measured with NISP data, 
as they can provide information on the AGN properties, in- 
cluding (at least in the case of Ha, H/5 and Mgll) the mass 
of the SMBH (e.g. Trakhtenbrot et al. 2011, Willott et al. 
2010b) 

The present version of EZ does not include line-width 
measurement, and so this analysis is performed with IRAF 
'specfit', fitting each spectrum with a simple two-component 
model - a power-law continuum and a single Gaussian line 
(with FWHM from 500 to 10000 kms -1 ). The returned pa- 
rameters (4) are power-law normalization and slope, Gaus- 
sian flux and FWHM, all with error bars. The line to be 
fitted in each spectrum is chosen as the brightest broad line 
visible (for 0.67 < z < 2.04, Ha; for 2.04 < z < 3.1, H/3; 
for 3.1 < z < 6.128, Mgll; for 6.128 < z < 8.1 CIV, and for 
z > 8.1, Lyman-a), and the Lyman-a fit was allowed to be 
skewed (in the range 0.8-1.3) and the other lines assumed 
to be symmetric. 



4.2 Results: Redshifts and Classifications 

For spectra such as these with broad lines and low signal- 
to-noise ratio, redshift measurements are likely to be less 
precise than for galaxies with strong narrow lines, and we 
allow here a wide tolerance of A(z) = ±0.05 for a redshift to 
be considered 'correct'. The scatter will be examined below. 

Examining the automated output for all 1468 analysed 
spectra, we can sort the results as follows: 

(i) spectrum is classed as an QSO with a correct redshift 
(148 spectra), 

(ii) correct redshift but classed as a non-AGN galaxy 
(20 spectra), 

(iii) incorrect redshift, classed as AGN (171 spectra), 

(iv) incorrect redshift, classed as a non-AGN galaxy 
(1129 spectra). 

A plot of A(z) — Zfu — Zinp U t against redshift (Fig. 
9) shows that that for spectra in the Lyman-a window 
there is a relatively large scatter between these two red- 
shifts. For spectra with correct (A(z) < ±0.05) redshift 
measurements in this z range (numbering 42) the scatter 
c{zfit — Zinp U t) = 0.0237 and this is also skewed, with a sys- 
tematic offset {zfu — Zinput) = 0.015 ± 0.004. This skewness 
must be due to the very asymmetric form of the Lyman-a 
line and break. In the Ha window the (68) spectra identified 
as AGN with correct z have a much smaller redshift scatter 
a ( z fit — Zinp U t) = 0.0026 with no systematic offset (as Ha 
is narrower and symmetric). 

Of the 20 spectra with correct redshifts, but incorrect 
classifications as non-AGN galaxies, none are at z > 8, most 
are at 1 < z < 3, and 7 are relatively bright with H < 
20 with an 8th only slightly fainter with H = 20.058. In 
2 of these 8 brighter spectra the misidentification was due 
to contamination distorting a region of the continuum (if 
these regions are excluded from the fit, the correct AGN 
classification is obtained). In the other 6 cases, EZ identified 
multiple emission lines above 5<r, which could be Ha, H/3 
and one or both of the [OIII] lines (5007, 4959). Many can 
be reclassified as QSOs using line width measurements. 

As a correct redshift is required beforehand in order 
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Figure 9. The difference between input redshift 2; n for a spec- 
trum and the redshift best- fitted by EZ, as a function of Zi„, show- 
ing the tolerance A(z) = ±0.05. Note the redshift-dependence of 
the scatter. 



to set an initial A for the line to be fitted, we perform the 
'specfit' analysis only on the 168 spectra for which EZ found 
a correct redshift. Fig. 10 shows examples of the specfit fits. 
Fig. 11 shows the measured FWHMs as a function of H 
magnitude for Ha, H/3, Mgll and Lyman-a lines. 

For Ha this procedure works very well: line widths can 
be measured accurately to at least H = 20.3, to which limit 
the mean FWHM is measured as 2923 km s _1 with scatter 
318 km s _1 . We can confidently reidentify as QSOs all 6 
of the misclassified spectra in this redshift range and above 
this limit. For H/3 we get accurate FWHM to at least H = 
19.3, at which limit the mean FWHM is measured as 3607 
km s _1 with scatter 587 km s , sufficient to reclassify two 
more of the spectra. For Mgll the limit for good FWHM 
measurement may be shallower but is at least H — 19.1 to 
which the mean FWHM is measured as 4863 km s _1 with 
scatter 871 km s _1 (the only misclassified spectra here are 
too faint to re-identify). In the Lyman-a window none of 
these spectra are misclassified but we can still test FWHM 
measurement. To H = 20.3, the mean is 4410 km s^ 1 but 
with a large scatter 1426 km s _1 ; a more sophisticated model 
may perform better due to the asymmetry and continuum 
break. 

By following redshift- finding with a measurement of line 
FWHM, we recover 8 more spectra into the 'correct' list of 
QSO identifications, giving a total of 156. 



4.3 Lyman-a Identifications, and the Gains from 
Refitting Spectra 

A key prediction of our simulation is that the ability to 
detect/recognize QSOs will greatly increase at the redshift 
where the strong Lyman-a line enters the spectrograph 
range (z > 8.06 for Euclid NISP). In our simulation, EZ 
in automated mode gave correct redshifts for 42 spectra 
at z > 8.06 (examples on Fig. 12). However, some fitted 
redshifts were wrong, even where the signal-to-noise should 
have been good enough and the Lyman-a was clearly visible, 
because of spectral contamination and overlapping. Often 
this affected only a fraction of the wavelength range, and if 
the bad region could simply be excluded from the fit, a cor- 
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Figure 10. Examples of IRAF Specfit fits to the spectral con- 
tinuum and brightest broad line, plotted as flux F\, units ergs 
cm - 2 s — 1 A — 1 , against A in Angstroms. These are spectra with 
correct redshifts from but misclassification as emission-line galax- 
ies. Two examples each are shown of fits to measure the FWHM 
of (a) Ha (b) H/3. 



rect redshift might be returned, but to do this the spectra 
have to be re-examined by eye. 

Although it will be impractical or inefficient to exam- 
ine by eye every spectrum, it will be possible to do this 
for the small subset of objects with evidence from multi- 
band optical-NIR photometry that they are at z > 8 ( i.e. 
they are Lyman-break objects which drop out of detection 
at A < 1.0-1. 1/xm). Photometric selection could be from the 
imaging component of the Euclid wide survey, which should 
be able to select z > 8 objects by Y — J colour down to 
J ~ 22 (Laureijs et al. 2011), and/or from ground-based 
optical-NIR surveys (UKIDSS, Vista Viking, Vista Hemi- 
sphere Survey etc.). 

On this basis we re-examine by eye all of the z > 8.06 
spectra where the automated fit gave wrong redshifts, and 
where possible re-fit excluding the obviously contaminated 
wavelength range (other than for the A range, no further 
restrictions are placed on fitting, i.e the full range of redshifts 
and templates are available). 

This method was very successful for spectra with a 
clearly visible Lyman-a line. We recovered correct redshifts 
and classification for 26 spectra at z > 8.06, which pre- 
viously had bad redshifts. This number includes 3 spectra 
where even after re-fitting zez is offset from Zi nput by 0.05- 



10 N. Roche, P.Franzetti, B.Garilli, G.Zamorani, A. Cimatti, E.Rossetti 



Specfit to Ha line for 0.67<z<2.04 spectra 




17 17.5 18 18.5 19 19.5 20 20.5 21 21.5 22 22.5 
H magnitude 

Specfit to Hp line for 2.04<z<3.1 spectra 





8000 




7000 




6000 


EE 






5000 


X 




S 




u_ 


4000 


CD 




T3 


3000 


CD 
tl 




Li. 






2000 




1000 








, hl classed as AGN 
* ' misclassified ' 



17 17.5 18 18.5 19 19.5 20 20.5 21 21.5 22 22.5 
H magnitude 

Specfit tc Mgll line for 3. 1<z<6. 128 spectra 



classed as AGN ^~ 
( c ) misclassified 



17.5 18 18.5 19 19.5 20 20.5 21 21.5 22 22.5 
H magnitude 

Specfit to Lyman-a line for z>8.1 spectra 




Figure 11. Measurements of the FWHM of the brightest visible 
broad line, using IRAF Specfit, for spectra with correctly mea- 
sured redshifts and both correct (AGN) and wrong classifications. 
Showing fits to (a) Ha (b) H/3 (c) Mgff (2800A) and (d) Lyman-a. 
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Figure 12. Some fainter (H > 20) examples of simulated spectra 
in the Lyman-a window for which the automated fit gave correct 
redshifts (zgz) an d classifications. 



0.1; these were included on the grounds that Lyman-a is 
still correctly identified (a more sophisticated fitting method 
might improve the redshift accuracy). However, re-fitting 
was rarely successful (only for 2 spectra) for spectra with 
H > 21.5 (generally too poor signal-to-noise ratio). Fig. 13 
shows some examples of contaminated spectra successfully 
re-fitted in this way. 

Objects at slightly lower redshifts (7 < z < 8.06) may 
also be selected by photometry, as z — Y dropouts. We re- 
examined the brighter spectra in this redshift range, but in 
only 2 cases were able to obtain correct redshifts by refitting, 
and notably these were both 'borderline' spectra at 8.02 < 
2 < 8.06, in which both the red wing of Lyman-a and the 
fainter CIV line were visible. 

Fig. 14 shows the positions of these 28 re-fitted spec- 
tra highlighted on the redshift-magnitude plot. If these are 
included the list of 'correct' QSO identifications increases 
from 156 to 184. 



5 QSO DETECTION RATE AS A FUNCTION 
OF MAGNITUDE AND REDSHIFT 

It is apparent from Fig. 14 that the limit to which QSO spec- 
tra can be correctly identified is very redshift-dependent, 
being deepest by far in the two windows where either Ha 
(0.67 < z < 2.05) or Lyman-a (z > 8.06) is visible to NISP. 
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Figure 13. Some examples of simulated spectra in the Lyman-a 
window, with contaminated regions (causing the discontinuities 
in the continuum level), for which the automated fit gave wrong 
rcdshifts (zez), but subsequent re- fitting (restricting the wave- 
length range to that indicated by the arrowheads) returned the 
correct redshift (2 re /;t) and classification. 



In both these ranges there are many correct identifications 
at H > 21, but at intermediate redshifts, there is none this 
faint. The intermediate range can be divided into two inter- 
vals, 2.05 < z < 3.0 where H/3 and [OIII] are visible, and 
3.0 < z < 8.06 where there are only the 'medium strength' 
broad-lines Mgll, CIII] and/or CIV to provide a redshift; 
QSO identification is a little better in the first. In the Ha and 
Lyman-a windows the detection rates fall to e _1 (37%) at 
H ~ 21.0, while for 2.05 < z < 3.0 this occurs at H ~ 19.6, 
and for 3.0 < z < 8.06, H ~ 19.3. In all ranges the detection 
rate does not cut off sharply, but declines over at least 1.5 
magnitudes, and for real AGN, with large variations in line 
equivalent- widths, this fall-off would be even more gradual. 

Dividing the spectra into these four redshift ranges, for 
each we calculate a detection rate as the proportion of (the 
extracted) spectra with both correctly (within ±0.05) mea- 
sured redshifts and classification as QSO (including the 8 
spectra reclassified using line FWHM), in A(H) = 0.5 mag 
intervals. A detection rate of unity is assumed brightward 
of the brightest simulated objects (e.g. H < 17), which may 
not be quite correct, but very few high- z QSOs will be this 
bright so this will not affect our conclusions. 

Fig. 15 shows the detection rates, and Table 2 gives 



the total (Nqso, extracted spectra only) and correctly de- 
tected numbers (Ndet, which does not include the 28 re- 
fitted spectra). If the 28 re-fitted z > 8 spectra discussed 
above are included, this gives the detection rates plotted as 
bold lines in Fig. 15, and the detected numbers in columns 
NdetR in Table 2. The detection rate in the Lyman-a win- 
dow is substantially increased by re-fitting, especially at 
21.0 < H < 21.5 (from 25% to 57%) although it still falls 
off steeply at H > 21.5. With the addition of re-fitting we 
predict that the QSO detection rates will be higher and with 
a deeper limit (effectively at least H ~ 21.5) for z > 8.06 
than in any other redshift range. 



6 PREDICTIONS FOR QSO DETECTION IN 
THE EUCLID NISP WIDE SURVEY 

6.1 How many QSOs will be found, including at 

2 > 8? 

Estimates can now be made of the total number of type 
1 AGN that would be identified in the entire NISP wide 
survey (15000 deg 2 ). Necessarily these will be dependent on 
the evolution model. Fig. 16 shows QSO number counts from 
our adopted model (Bongiorno et al. 2007 with k ev — —0.47 
at z > 2.7), now divided into 7 redshift intervals. These 
model counts are multiplied by our simulation's estimate 
of the QSO detection rate as a function of H and z, to 
give the detected-QSO counts (Fig. 17; for the detection 
rates both without and with the inclusion of the re-fitted 
spectra). These detected counts, summed over H for each 
redshift interval and multiplied by 15000 deg 2 , predict the 
total numbers that will be found in this area down to the 
faint limit of the survey (Table 3, columns 2 and 3). 

In total, we estimate the Euclid wide spectroscopic sur- 
vey will find and identify about 1.41 million QSO, but 
with the great majority (92%) in the Ha window. With 
kev = —0.47 evolution, the predicted number at z > 8.06 
(with automated redshift finding) is 22.27 (with the most 
distant one or two QSOs at z = 10-12). With the improved 
rate of identification following by-eye refitting, this number 
increases by 59% to 35.43. 

Using instead the LF of Willott et al. (2010a), which is 
normalized at z = 6 with evolution k ev — —0.47 to z > 6 
(this cannot be extrapolated to lower redshifts) gives the 
predicted counts in columns 4 and 5 of Table 3. These are 
a factor 1.75 lower, with 12.68 in the Lyman-a window (in- 
creasing to 20.05 with refitting). As discussed in Section 
2.3 our LDDE model gives a higher normalization at these 
redshifts (approximately that of the Jiang et al. 2009 LF). 
While at the time of writing the Willott et al. (2010a) LF 
is arguably the best estimate at z ~ 6, their Fig. 5 suggests 
that the true normalization for the bright end is probably in- 
termediate between the two functions. On this basis, and as 
it is certain that photometric selection of z > 8 objects will 
be available, we give a prediction for Lyman-a (z > 8.06) 
QSOs detectable in the Euclid wide survey as 20-35. 
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Figure 14. Results of EZ analysis of the simulated QSO spectra, shown on a plot of input redshift against H apparent magnitude. 
Symbols of 4 types indicate whether spectra have correct or incor- rect redshift measurements, and whether or not they have correct 
classifications (as QSOs). Additional symbols highlight the spectra (8) that were reclassified as AGN by line FWHM fitting, and the 
z > 8 spectra with contamination, for which refitting 'by-eye' gave correct redshifts (28). 
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Table 2. Number of QSO spectra included in the simulation Nqso (extracted spectra only) and the numbers with correct redshift and 
classification N det , in A(iT) = 0.5 mag bins of apparent magnitude and 4 intervals of redshift. The columns N det R (for the two highest 
redshift intervals only) show the (higher) numbers of detections with the inclusion of the redshifts obtained using the method of refitting 
spectra (to exclude contaminated regions; see Section 4.4). 
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Figure 15. The proportion of spectra correctly detected (in 
A(ff) = 0.5 magnitude bins, divided into four intervals of red- 
shift. Detection limits are deepest in the 'windows' where Ha 
(0.67 < z < 2.05) or Lyman-a (8.06 < z < 12) arc visible. For 
the highest 2 redshift intervals the detection rates are plotted 
without and with (bolder lines, same colours) the inclusion of the 
28 refitted rcdshifts. 
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Figure 17. The number counts from the model multiplied by the 
simulation's detection rate (as a function of H and z), calculated 
excluding and (as bold lines) including refits, to predict the count 
of correctly identified QSOs. 
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Table 3. The total number of QSOs in different redshift ranges 
detectable in a 15000 deg 2 wide Euclid NISP survey, for this pa- 
per's evolution model (with k ev = —0.47 negative density evolu- 
tion at all 2 > 2.7), and (at z > 6 only) the Willott et al. (2010) 
QSO luminosity function (W10) with k ev = —0.47 evolution at 
2 > 6. Shown for the detection rates estimated without and with 
re-fitting contaminated high-2 spectra. 



Figure 16. H-b&nd differential QSO number counts from our 
model, divided by redshift. 



However, it has been suggested (Steve Warren, private 
communication) on the basis that UKIDSS has so far found 
only 1 QSO at z > 6.5, that the negative $* evolution at 
these redshifts could be steeper with k ev ~ —0.72 (extrapo- 
lated from a Willott et al. LF at z = 6). Although current 
observations remain consistent with k ev = —0.47, there are 
models (e.g. Shankar et al. 2010; DeGraf et al. 2011) that 
predict a faster evolution at z > 6, approximately in this k ev 
range. With the Willott et al. LF and k ev — —0.72, the ex- 
pected count at z > 8.06 would be reduced by more than a 
factor 4, to 2.81 (4.43 with refitting), with very little chance 
of finding a z > 10 QSO. 

Whatever the evolution rate, the estimates in given in 
Table 3 should probably be subject to a downward correction 
of about 6% because of the small fraction of very weak-lined 
or lineless QSOs (see Section 2.1), which with the methods 
of redshift finding discussed in this paper would probably 
be missed. The prediction for Lyman-a detections is then 
reduced to 19-33. 



It might also be possible that at z > 8, the fraction 
of intrinsically weak-lined/lineless QSOs could increase sig- 
nificantly above that seen at 4 < z < 5, and/or that for 
2 > 8 QSOs the blocking of Lyman-a by HI in the IGM 
could be much greater than assumed in our model (e.g. the 
ionised zones are much smaller). If either of these occur it 
would lower the number of detections to some (unknown) 
extent. By the time Euclid is launched, the normalization of 
the QSO LF at z — 6 should be very well determined, and 
there may be further detections at z ~ 7, so the numbers in 
the NISP Lyman-a window will constrain the evolution at 
z > 7. With careful analysis of the highest redshift spectra 
(such as the refitting by eye tried here) to maximize de- 
tections, the survey should have good enough statistics to 
distinguish (at high significance) between a continuation of 
k ev ~ —0.47, and the k ev ~ —0.72 or steeper evolution of 
the maximally rapid (unity duty cycle Eddington-rate) ac- 
cretion models. In addition to giving an estimate of k ev , the 
redshift distribution of the detections will directly trace the 
evolution over 8 < z < 10-12. 
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Figure 18. The B-band absolute magnitudes for the 184 QSO 
spectra in our simulation which have correct redshifts (includ- 
ing refits) and classification. Most in the Lyman-a window have 
-28 < M B < -26. 



6.2 X-ray Fluxes of z > 8 QSOs 

It is of interest whether z > 8 QSOs will also be de- 
tected in forthcoming wide X-ray surveys. QSO X-ray lu- 
minosities can be estimated approximately from optical lu- 
minosities, Mb- Fig. 18 shows the B-band absolute mag- 
nitudes of the 184 simulated QSO spectra that would be 
detected (including the re-fits). The QSOs detectable (with 
Euclid) in the Lyman-a window have luminosities ranging 
from M B = -25.87 up to M B = -29.34 with most at 
-28 < M B < -26. All are above M* (M B ~ -25.81) and 
thus lie on the bright-end slope, in the same range of M B as 
the highest redshift (z ~ 6) QSOs found in the SDSS (e.g. 
Jiang et al. 2009) and the z = 7.085 discovery of Mortlock et 
al. (2011), for which M B ^ -27.2. Note that Euclid could 
detect z > 8 QSOs at least ~ 1 magnitude less luminous 
than ULAS J1120+0641. 

Lusso et al. (2010) investigate the UV to X-ray ratio for 
a large XMM-COSMOS sample of type 1 broad-line AGN, 
with a very wide redshift (up to z = 4 with z m ean = 1.66) 
and luminosity range. They fit a non-linear relation 

log L(2 keV) = 0.760 log L(2500A) + 3.508 
Assuming, from the SDSS QSO template, F„(2500A) = 
0.8789F„(B), and from Tozzi et al. (2006), that typical QSO 
spectrum at 0.5-7 keV is a power law F v oc z/ -0 ' 8 (photon 
index 1.8), we derive 

log L(0.5 - 2.0 keV) = 36.917 - 0.304M S in ergs s" 1 

Fluxes can be estimated from this luminosity, with the 
k-correction in log flux being 0.2 log (1 + z). 

Fig. 19 shows the predicted F(0.5 - 2.0keV) fluxes 
(observer-frame) for the simulation's detected QSOs, com- 
pared to 5a point-source detection limits in this band for 
the forthcoming eROSITA (extended Roentgen survey with 
an imaging telescope array) all-sky survey and deep fields 
(Capelluti et al. 2010), and for the all-sky and medium and 
deep surveys with the proposed Wide Field X-ray Telescope, 
WFXT (Brusa et al. 2010; Gilli et al. 2010). These 4 limits 
are respectively 1 x 10~ 14 , 2 x 10~ 15 , 3 x 10~ 15 , 5 x 10~ 16 
ergs cm _2 s _1 . 

It is predicted that z > 8.06 QSOs detectable with Eu- 
clid would generally have 0.5-2.0 keV fluxes 1-4 x 10 -15 ergs 
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Figure 19. The 0.5-2.0 keV X-ray fluxes predicted (from the M B 
magnitudes) for all (182) correctly identified QSOs in the simula- 
tion, compared with expected detection limits for the eROSITA 
all-sky survey and deep fields, and the wide and medium survey 
with the proposed WFXT. 



cm~ 2 s _1 . None of these (unless they have unusually high X- 
ray/optical ratios) would be detectable in the eROSITA all- 
sky survey. Some may be within the limit for the eROSITA 
deep fields at the ecliptic poles, which cover much smaller 
areas (~ 200 deg 2 total) but by chance might contain one. 

If the more sensitive WFXT is launched and attains its 
proposed limits, Fig. 19 suggests about i of Euclid-NISP's 
z > 8.06 QSO detections will be within reach of its all-sky 
survey. With k ev ~ —0.47 evolution this sample would in 
total be 20-35, so the Euclid-WFXT overlaps would be 3-5 
QSOs. All Euclid-detectable QSOs should easily within the 
limits for the WFXT medium survey, which is planned to 
cover a large area of 3000 deg 2 and so would include and 
detect a further 4-7 of the Euclid z > 8.06 QSOs. It will 
probably detect further z > 8 QSOs too faint for Euclid 
spectroscopy, but for which Euclid imaging would provide 
photometric evidence for z > 8. We conclude Euclid would 
be well matched with WFXT for studying the highest red- 
shift (z > 8) AGN (eROSITA appears not to be sensitive 
enough) . 



7 SUMMARY AND DISCUSSION 

(i) We simulate the detectability of type 1 QSOs in a wide- 
field near-infra-red slitless spectroscopic survey, using the 
parameters of the Euclid near-infrared spectrometer and 
imaging photometer (NISP). The spectra are represented 
using the SDSS composite QSO template of Vanden Berk 
et al. (2001), extended shortward and longward following 
Telfer et al. (2002) and Richards et al. (2006) respectively, 
and the simulation included QSOs at all redshifts out to 
z = 12. We add the effect of the (redshift-dependent) IGM 
absorption on the spectrum at \ re st < 1216A, using the 
model of Madau (1995). 

At the redshifts (z > 8.06) where the strong Lyman-a 
emission line enters the the NISP wavelength range (11000- 
20000A), the absorption from HI will be very strong, con- 
sequently Lyman-a in these spectra will appear asymmetric 
because of absorption on the blue wing and there will be 
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very sharp and deep spectra break at A rest ~ 1200A. How- 
ever, due to the 'bubble' of ionized hydrogen surrounding 
a high-z QSO (which is also represented in our model) the 
peak of the Lyman-a line should not be absorbed at all and 
will remain visible. 

(ii) In the simulation the QSO luminosity function and 
its evolution were modelled using the luminosity depen- 
dent density evolution model of Bongiorno et al. (2007). At 
z > 2.7 this was extrapolated by a simple exponential de- 
cline in density, <f> oc i(r - 47 (*- 2 - 7 ) (fc e „ = -0.47). As the 
proportion of very high redshift QSOs in any flux-limited 
sample will be extremely small, their numbers are scaled up 
by a large factor (10 ) in the simulation, while retaining 
the magnitude and redshifts from the model. The QSO simu- 
lation covers a 0.5 x 0.5 deg 2 area and is limited at H = 22.5. 
The effects of spectral contamination and overlap are repre- 
sented in part by the artificially high density of QSO spectra 
but also by mixing the QSOs with faint emission-line galaxy 
spectra. 

(iii) The 1468 QSO spectra that could be extracted from 
the AXESIM image were analysed with an automated redshift 
finder (ez), which initially found correct redshifts (within 
A(z) = ±0.05) and classifications (as QSO spectra) for 148. 
For a further 20 the redshifts were correct but the spectra 
were misidentified as non-AGN galaxies. The second step 
of our analysis was to fit the brightest lines with Gaussian 
profiles, and if the line FWHM > 2000 km s _1 , the spectrum 
(if initially classed as a galaxy), is reclassified as type 1 AGN. 
Using IRAF 'specfit', line FWHM could be measured well 
enough to confidently identify type 1 AGN to at least H ~ 
20.3 where Ha is visible and H — 19.3 using H/3, and we 
could reclassify 8 of the 20 misclassified spectra as QSOs. 

The successful detection rate (i.e. fraction of spectra 
with both correct redshift and classification) is highest, and 
detection reached deeper limits (H ~ 21-21.5) in the two 
redshift windows where either Ha (0.67 < z < 2.05) or 
Lyman-a (z > 8.06) is visible in the spectrograph A range 
(this is not surprising as these are by far the strongest lines 
in terms of flux and equivalent width). Detection limits are 
less deep where the redshift is provided by H/3 and [OIII] 
(2.05 < z < 3.0) and shallowest in the intermediate range 
(3 < z < 8.06) where the useful broad lines are Mgll, CIII] 
and CIV. 

(iv) In some cases the automated fit gave a wrong red- 
shift because of contamination affecting part (even a small 
part) of the spectrum. However, Euclid and/or ground-based 
optical-NIR imaging will provide an independent photomet- 
ric selection of z > 8 objects within the limits of the spec- 
troscopic survey, e.g on the basis of a drop-out in Y — J. 
All spectra flagged in this way could then be carefully ex- 
amined by eye and re-fitted, with exclusion from the fit of 
obviously contaminated wavelength regions (generally either 
at the blue or the red end). In this way it is possible to 
quickly and very significantly improve the QSO detection 
rate at z > 8. 

We performed new fits on the z > 8 spectra where au- 
tomated fitting had given wrong redshift measurements, re- 
stricting the fit's wavelength range (by eye) to exclude ob- 
viously contaminated regions while including the Lyman-a 
line. In this way we obtained the correct redshifts for a fur- 
ther 28 spectra, almost all at z > 8.06 and H < 21.5. For 



QSOs in the range 21 < H < 21.5, the inclusion of refit 
redshifts gave more than a doubling of the estimated QSO 
detection rate (from 25% to 57%). This method did not 
work well at H > 21.5 (signal-to-noise ratio too low) or at 
7 < z < 8 (Lyman-a not visible). Over all magnitudes, by- 
eye refitting would increase the sample of detected z > 8.06 
QSOs by a very substantial ~ 60%. With refitting, Euclid 
would now attain higher detection rates and deeper limits 
(effectively H = 21.5) for QSOs at z > 8.06 than in any 
other redshift range, thanks to the strength of the Lyman-a 
line and break. 

(v) Combining the QSO detection rate from our simu- 
lation with the LDDE model for QSOs, we predict that in 
the Euclid NISP wide survey, area 15000 deg 2 , a total of 
1.41 million type 1 QSOs will be found, with most (92%) 
in the Ha window. In the Lyman-a window (z > 8.06), 
with the LDDE model and k ev — —0.47 evolution we pre- 
dict 22.6 QSOs will be detected and identified. Using instead 
the Willott et al. (2010a) LF for z ~ 6 QSOs, again with 
k ev = —0.47 evolution, this falls to 12.7. We took these two 
estimates as spanning the probable range of uncertainty in 
the (bright-end normalisation of the) QSO luminosity func- 
tion at z ~ 6. With refitting contaminated spectra, Euclid's 
detection rates at these redshifts are ~ 60% higher, and 
the range of prediction for z > 8.06 QSOs increases from 
12.7-22.6 to the range 20-35. This reduces again slightly to 
19-33 if a small down-correction is applied for the 6% frac- 
tion (from Diamond-Stanic et al. 2009) of very weak-lined 
and lineless QSOs. Out of this number, we predict Euclid's 
highest redshift detection is likely to be at z ~ 10-11. 

However, the numbers of QSO detections at z > 8.06 
would reduce to only 4 or 5 if the evolution at z > 6 steep- 
ens to k ev ~ —0.72, which is approximately as predicted 
by models with Eddington-rate accretion and a duty cycle 
of unity (e.g. Shankar et al. 2010). The Euclid wide survey 
will give good enough statistics on z > 8 QSOs to distin- 
guish between these two possibilities, estimate k ev and set 
important constraints on the early evolution of AGN. 

Any QSOs detectable at z > 8.06 would lie on the 
bright-end slope of the AGN luminosity function (—26 > 
Mb > —28), with SMBH masses ranging from at least 
~ 10 8,5 Mq, on the basis of, e.g. the Mbh estimates of 
Willott et al. (2010b), up to the - 2 x 1O 9 M of ULAS 
J 1120+0641. Their discovery at z > 8 epochs would fur- 
ther challenge models to explain their early/rapid formation 
and may favour larger masses for the seed black holes (e.g. 
~ 10 Mq as in the nuclear star cluster collapse scenario of 
Davies, Miller and Belovary 2011). 

(vi) Using the QSO X-ray to UV relation of Lusso et al. 
(2010), we estimate the X-ray fluxes of the i?iidid-detectable 
z > 8 QSOs would typically be F(0.5-2.0keV) = 1-4 x 10~ 15 
ergs cm~ 2 s _1 , too faint for the eROSITA all-sky survey to 
detect. However, some of these AGN would be detectable 
on the proposed WFXT wide survey, and all would be 
within the flux limits of the WFXT medium survey. With 
k ev = —0.47 evolution we predict a z > 8.06 overlap between 
Euclid wide and WFXT wide of 3-5 QSOs, and of 4-7 QSOs 
between Euclid wide and the 3000 deg 2 WFXT medium sur- 
vey; evidently the two telescopes are well matched for the 
highest redshift QSOs. 
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